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Abstract 

We study the quantum complexity of the static set membership problem: given a 
subset S (\S\ < n) of a universe of size m (S> n), store it as a table, T : {0, l} r — > 
{0, 1}, of bits so that queries of the form 'Is x in 5?' can be answered. The goal is to 
use a small table and yet answer queries using few bit probes. This problem was con- 
sidered recently by Buhrman, Miltersen, Radhakrishnan and Venkatesh [ BMRV0C| ] , 



who showed lower and upper bounds for this problem in the classical deterministic 
and randomised models. In this paper, we formulate this problem in the "quantum 
bit probe model". We assume that access to the table T is provided by means of a 
black box (oracle) unitary transform Ot that takes the basis state \y, b) to the basis 
state \y, 6©T(y)}. The query algorithm is allowed to apply Ot on any superposition 
of basis states. 

We show tradeoff results between space (defined as 2 r ) and number of probes (or- 
acle calls) in this model. Our results show that the lower bounds shown in | BMRV0C| 1 



for the classical model also hold (with minor differences) in the quantum bit probe 
model. These bounds almost match the classical upper bounds. Our lower bounds 
are proved using linear algebraic arguments. 

Keywords: Data structures, set membership, bit probe model, quantum black box model, 
linear algebraic methods, lower bounds, space-time tradeoffs. 



1 Introduction 

In this paper we study the static membership problem: Given a subset S (\S\ < n) of a 
universe of size m (^> n), store it efficiently and succinctly so that queries of the form 
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"Is x in ST' can be answered quickly. This fundamental data structure problem has 
been studied earlier in various settings (e.g. by Minsky and Papert ||MP69|| , Yao ||Yao81 



Fredman, Komlos and Szemeredi [ FKS84 |, Pagh [|Pag01 |). Most of these results were in the 
classical deterministic cell probe model. Recently, this problem was considered by Buhrman, 
Miltersen, Radhakrishnan and Venkatesh [[BMRVOOf in the classical bit probe model, which 
was introduced in ||MP69|| ; they studied tradeoffs between storage space and number of 
probes in the classical deterministic case, and also showed lower and upper bounds for the 
storage space when the query algorithm was randomised and made just one bit probe. In 
the classical bit probe model the storage scheme is deterministic and stores the given set 
as a string of bits. The query scheme is either deterministic or randomised and answers 
membership queries probing only one bit of the string at a time. 

In this paper, we allow the query algorithm to perform quantum search on the table of 
bits representing the stored subset S. The storage scheme which encodes S a table of bits, 
T : {0, l} r — > {0, 1}, continues to be classical deterministic. To formalise this, we define 
the quantum bit probe model. The table of 2 r bits, T, is modelled using an oracle unitary 
transformation Ot that takes the basis state \y, b) to \y, b © T(y)). Basically, the table of 
bits is accessed by the query algorithm as in the well-studied quantum black box model (see 
e.g. |BBC + 98|1 ). If the inputs to the oracle Ot are restricted to basis states, Ot reduces to 
(the reversible version of) the classical table that stores one bit for each address in {0, l} r . 
We, therefore, define the space used by the quantum bit probe scheme to be 2 r . 

The main point of departure from the classical model, is in the query algorithm. We 
allow the algorithm to feed a superposition of basis states to the oracle. Each use of the 
oracle counts as one probe of the table, even if a superposition is supplied to the oracle, 
and the output depends on the value of several bits of the underlying table T. In the 
preparation of this superposition and in the processing of the output returned by the 
oracle, we allow arbitrary unitary transformations. It is known that this form of access 
often leads to significant improvements over classical algorithms for several problems (e.g. 
Grover's algorithm ||Gro96|| for searching an unordered database). 

Previously, the number of probes to the black box as a complexity measure had been 
studied in the quantum setting (e.g. ||BBBV97| , [BB(J^98| , |AmbUq , |Gro96|| ). Both lower 
bounds and upper bounds for various problems were proved. The main contribution of 
this paper is the study of tradeoffs between storage space and number of probes for a 
static data structure problem in the quantum setting. For the set membership problem, 
we show that several limitations of classical computation (shown in |p3MRV00|] ) continue 
to persist even if quantum query algorithms are allowed. This is surprising, because for the 
(superficially) similar problem of searching an unordered database, quantum computation 
helps. 

Our tradeoffs between storage space and the number of quantum probes are proved 
using linear algebraic arguments. Roughly speaking, we lower and upper bound the di- 
mension of a set of unitary operators arising from the quantum query algorithm. The lower 
bound on the dimension arises from the 'correctness requirements' of the quantum algo- 
rithm. The upper bound on the dimension arises from limitations on the storage space and 
number of probes. By playing the lower and upper bounds against each other, we get the 
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desired tradeoffs. To the best of our knowledge, this is the first time that linear algebraic 
arguments have been used to prove lower bounds for data structure problems, classical 
or quantum. Counting of dimensions has been previously used in quantum computing 
(see e.g. ||AST + 98j PdWOlj ), but in quite different contexts and ways. Linear algebraic 
arguments similar to ours have been heavily used in combinatorics. For a delightful intro- 



duction, see the book by Babai and Frankl ||BF92 



1.1 Our results 



The exact quantum model: Buhrman et al. ||BMRV00|j have shown, for classical de- 
terministic query algorithms, that any (s, t)-scheme (which uses space s and t bit probes) 
satisfies ( m ) < f s J2 n *. We show a stronger (!) tradeoff result in the quantum bit probe 
model. 

Result 1 Suppose there exists a scheme for storing subsets S of size at most n 
from a universe U of size m that uses s bits of storage and answers membership 
queries, with zero error probability, with t quantum probes. Then, 
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This has two immediate consequences. First, by setting t = 1, we see that if only one probe 
is allowed, then m bits of storage are necessary. (In [BMKV00|, for the classical model, this 
was justified using an ad hoc argument.) Thus, the classical deterministic bit vector scheme 
that stores the characteristic vector of the set S and answers membership queries using one 
bit probe, is optimal even up to quantum. Second, it follows (see pMRV00|j for details) 
that the classical deterministic scheme of Fredman, Komlos and Szemeredi ||FKS84| , which 
uses O(nlogm) bits of storage and answers membership queries using O(logm) bit probes, 
is optimal even up to quantum — quantum schemes that use O(nlogm) bits of storage 
must make fi(logm) probes if n < m l ~ n ^ . Recently, Pagh [ FagO 1|| has shown classical 
deterministic schemes using the information-theoretic minimum space 0(nlog(m/n)) and 
making 0(log(m/n)) bit probes, which is optimal even up to quantum, by the above 
result. For t between 1 and 0(log(m/n)), Buhrman et al. [ [BMRVOOH have given classical 
deterministic schemes making t bit probes, which 0(m 3 ^n log m) bits of storage. A lower 
bound of f^n^m/n) 1 /*) for storage space, for suitable values of the various parameters, 
follows from Result [I]. Thus, if we only care about space up to a polynomial, classical 
deterministic schemes that make t bit probes for t between 1 and 0(log(m/n)), and which 
use storage space almost matching the exact quantum lower bounds, exist. 

Interestingly, the above theorem holds even in the presence of errors, provided the error 
is restricted to positive instances, that is the query algorithm sometimes (with probability 
< 1) returns the answer 'No' for a query x that is actually in the set S. This was not 
observed earlier even in the classical model, although one can easily modify the proof of 



the tradeoff result in [BMRV00| to give this 
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The e-error model: In the classical model, there exists a scheme for storing subsets of 
size at most n from a universe of size m that answers membership queries, with two-sided 
error at most e < 1/16, using just one bit probe, and using storage space 0( nl °f m ). Also, 
any such one probe scheme making two-sided error at most e must use space ^( ^tt )' 
Both the upper bound and the lower bound have been proved in ||BMRV00|| . By two-sided 



error, we mean that the query algorithm can make an error for both positive instances 
(the query element is a member of the stored set), as well as negative instances (the query 
element is not a member of the stored set). Since different sets must be represented by 
different tables, every scheme, no matter how many probes the query algorithm is allowed, 
must use VL{n\og{m / n)) bits of storage, even in the bounded two-sided error quantum 
model. However, one might ask if the dependence of space on e is significantly better 
in the quantum probe model. We show the following lower bound which implies that a 
quantum scheme needs significantly more than the information-theoretic optimal space if 
sub-constant error probabilities are desired. 

Result 2 Let n/m < e < 1/8. Suppose there is a scheme with two-sided error 
e which stores subsets of size at most n from a universe of size m and answers 
membership queries, with two-sided error at most e, using one quantum probe. 
It must use space 

n\og(m/n) 



n 



■V6 log(l/e) 



The method used to prove this result can be generalised to algorithms that use more 
probes than one. 

Result For any p > 1 and n/m < e < 2~ 3p , suppose there is a scheme 
which stores subsets of size at most n from a universe of size m and answers 
membership queries, with two-sided error at most e, using p quantum probes. 
Define 5 — e^ p . It must use space 



n\og(m/n) 
5V6 log(l/5) 



Such a tradeoff between space and error probability for multiple probes was not known 
earlier, even in the classical randomised model. We note that for p bit probes, an upper 
bound of 0( "^°/ p m ) on the storage space, for e < 2 _p , follows by taking the storage scheme 

of |BMRV00|1 for error probability ^p, and repeating the (classical randomised) single 
probe query scheme p times. This diminishes the probability of error to e. Thus, our lower 
bounds for two-sided error quantum schemes roughly match the two-sided error classical 
randomised upper bounds. 



The results described above are inspired by similar results proved earlier in [BMRVOO 
in the classical model. However, the methods used for classical models, which were based on 
combinatorial arguments involving set systems, seem to be powerless in giving the results 
in the quantum model. Our results are based on linear algebraic arguments, involving 
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counting the dimensions of spaces of various operators that arise in the quantum query 
algorithm. 



Bounds for classical models: As stated above, Result [I] is stronger than what was 
known earlier, even in the classical deterministic model. One might wonder if this stronger 
result is somehow easier to prove in the classical deterministic model. We show that 
the linear algebraic techniques used in the proof of Result [I] can be considerably simplified 
when we assume the classical deterministic model, and give the same inequality as stated in 
Result |l| Also, one can easily modify the proof to yield the same tradeoff for randomised 
query schemes where the error is restricted to positive instances (i.e. when the query 
element is a member of the stored set). 

The proof in | BMRV00H of the space lower bound for a classical two-sided e-error ran- 



domised query scheme, namely s = ^( ^"J"^^ ), involved some tricky use of both upper 



and lower bounds for r-cover-free families shown by Nisan and Wigderson [|NW94j| , Erdos 



Frankl and Fiiredi | EFF85|| , Dyachkov and Rykov |pR82|| . The proof of the analogous 



bound (Result §) in the quantum model completely avoids these. In fact, we give a proof 
of a (slightly weaker) lower bound s = ^( ^^w^y ) m t ne classical randomised model by 
adapting the ideas used in the proof of Result |2] to the classical setting. We first diminish 
the error probability of the one-probe query algorithm by repetition and then we can, by 
fixing the random coin tosses, make it a deterministic query algorithm which however, 
uses more than one probe. We then apply our (classical) deterministic space-time tradeoff 
equation to complete the proof. This approach is inspired by our proof of Result ||. Besides 
being simpler, this proof has the advantage that it generalises readily to more than one 
probe. No such result was known earlier in the classical setting. 

Result 3 Let p > 1, 18~ p > e > 1/m 1 / 3 and m 1 / 3 > 18n. Define S = e 1/p . 
Any classical scheme which stores subsets of size n from a universe of size m 
and answers membership queries, with two-sided error at most e, using at most 
p bit probes must use space 



n 



n log m \ 
S 2 / 5 log(l/<5) ) 



1.2 Organisation of the paper 

In the next section, we describe our quantum bit probe model formally and give the 
framework of our proofs. Detailed proofs of all our results (for both quantum and classical 
models) appear in Section |3|. We conclude in Section | with some open problems. 

2 Definitions and notations 

In this section we first describe our quantum bit probe model and then give some formal 
definitions and notations which will be used in the proofs of the theorems. 
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2.1 The model 



Our model is a quantum analogue of the classical bit probe model which has been exten- 



sively used in the past to study data structure problems (see e.g. |[MP69| , |Mil93| , [BMRVOO 

A static data structure problem is a function / : D xQ — > {0, 1}, where D is a finite set 
called the set of data to be stored and Q is a finite set called the set of queries. A classical 
(s, t)-bit probe scheme for a static data structure problem consists of a deterministic storage 
scheme which stores the given data d as a bit string of length s, and a query scheme which 
given a query q makes at most t bit probes to the stored string and computes f(d,q). 
The query scheme can be either deterministic or randomised. For more details about the 
classical model, see ||BMRVO0} . 

A quantum (s, t)-bit probe scheme for a static data structure problem has two com- 
ponents: a classical deterministic storage scheme that stores the data d using s bits, and 
a quantum query scheme that answers queries by 'quantumly probing a bit at a time' at 
most t times. 

The storage scheme: For the set membership problem, the data to be stored is a subset 
S of a universe U (\S\ < n, |U| = m). Let x(S) G {0, 1} S be the bit string that is stored 
by the storage scheme for recording S. The storage scheme is classical deterministic. The 
difference now, is that this bit string is made available to the query algorithm in the form 
of an oracle unitary transform 0$- To define Os formally, we represent the basis states 
of the quantum query circuit as \j,b,z), where j G [s] is a binary string of length logs 
('address qubits'), b is a single bit ('data qubit'), and z is a binary string of some fixed 
length ('work qubits'). Let x(S)j be the bit stored at the jth location in the string x(S). 
The action of Os on a basis state is described below. 

O s :\j,b,z)^(-l) b < s ^\j,b,z) 

Remark: The oracle described in the introduction mapped \j, b, z) to \j, b © x(S)j, z). It 
is known that the oracle Os defined above is equivalent in power to this oracle. 

Thus, information about the string x(S) appears in the phase of the basis states in the 
output, and Os is represented by a diagonal matrix (in the standard computational basis): 
the zth diagonal entry, where i = \j, b, z), is 

{O s ) hl = (-lf<*)i 

For T C [s] and x G {0, 1} S , define [x]t — J2ieT x i ( m °d 2). In particular, [x]$ = 0. Thus, 
(Os)i,i — (— where ^ is some subset of [s] of size 1 (when b = 1, Zj = {j}) or 
(when b = 0, l { = 0). 

Remark: Our model for storage does not permit Os to be any arbitrary unitary transfor- 
mation. However, this restricted form of the oracle is closer to the way bits are accessed 
in the classical case. Moreover, in most previous works, storage has been modelled using 

0550011 ) . 
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The query scheme: Suppose a subset S C U, l^l < n, has been stored and x(S) G 
{0, 1} S is the corresponding bit string. A quantum query scheme with t probes is just a 
sequence of unitary transformations 

Uo _> o s - C/i - O s - . . . C/t-i - 5 - ^ 

where L^-'s are arbitrary unitary transformations that do not depend on the set S stored. 
For a query q G U, the computation starts in an observational basis state |?)|0), where 
we assume that the ancilla qubits are initially in the basis state |0). Then we apply the 
operators Uo, Os, ■ ■ ■ , Os, Ut and measure the final state. The result of the query is the 
rightmost bit of the state obtained by the measurement. The query scheme can be exact 
or have error; the error can be one-sided or two-sided. When the query scheme is exact, 
the measurement of the final state gives the correct answer with probability 1. If one-sided 
error e is allowed, the measurement produces a with probability 1 when the answer is 0, 
but when the answer is 1, is required to produce a 1 with probability only at least 1 — e. If 
two-sided error e is allowed, the answer can be wrong, with probability at most e, for both 
positive and negative instances. 

The framework for the proofs: We now describe the general framework in which the 
various proofs are presented and also give some definitions and notations which will be 
used throughout the paper. 

For a query q G U, define \(p q ) = \q)\0). The set of vectors \4> q ),q G U form an 
orthogonal system of vectors. They are independent of the set S stored. 

Define two Hilbert spaces, A and A±, where Ai is the space of all state vectors that 
can be spanned by basis states having an i at the rightmost bit (i.e. if the state vector 
lies in Ai, then on measuring the rightmost bit at the output, one gets i with probability 
1). Then the entire state space V decomposes as an orthogonal direct sum of the spaces 
A^A-y. 

Define the unitary transformations {Ws} 5CU | 5 | <n as follows. 

W s = UtOsUt-iO s Ut-20 s ■ ■ ■ UiOsUiOsUo 

Thus when a set S is stored, in the exact quantum case, Ws\4>i),i G S lie in A\, 
and Ws\<fti),i S lie in A . In the one-sided e-error case, Ws\4>i),i S lie in A , but 
Ws\4>i),i G S may not lie entirely in Ai, but in fact may have a projection on A of length 
at most ^/e. In the two-sided e-error case, Ws has to send the vector "approximately" 
to the correct space, i.e. the projection of Ws\4>i) on the correct space is of length more 
than Vl — e - 

Notation: In the proofs we have to take tensor products of vectors and matrices. For 
any vector v or matrix M, we have the following notation, 

V ® 1 = y . . ■ y 
t times 
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M m = M <g> • • • ® M 

V v ' 

i times 

We note that since the entire state space V is the orthogonal direct sum of A and A±, 

and the 2* vector spaces in the above direct sum are pairwise orthogonal. 

Below, A A B stands for the symmetric difference of sets A and B; stands for the 
conjugate transpose of the matrix M. 

3 Proofs of theorems 
3.1 Quantum schemes 

We first prove our space v/s probes tradeoff result for exact quantum schemes. 

Theorem 1 Suppose there exists a scheme for storing subsets S of size at most n from 
a universe U of size m that uses s bits of storage and answers membership queries, with 
zero error probability, with t quantum probes. Then, 

±(D<-±§ 

j=0 V 7 1=0 v 7 

Proof: We use the notation of Section |[ For any subset S C U, l^l < n, let us define 

W s = U t O s U t ^O s U t ^Os ■ ■ ■ UzOsUtOsUo 



Claim 1 {W® n }g e ( u \ are linearly independent. 
Proof: Suppose there is a nontrivial linear combination 

£ osHf = 

Let T be a set of largest cardinality such that «t 7^ and let T — {ii, . . . , i^}, k < n. We 
define a vector 

l0T) = |0u) 0(n - fc+1) ®l^)®---®l^ fc ) 

Applying to the linear combination above, we have 

£ a s W® n \<f> T ) + a T W^\<h) = (1) 

For any set S, 

w® n \<j> T ) = (^5i0u>r (n - fc+1) ® w s \^ 2 ) ® ■ ■ ■ ® Wsi&j 
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• If S = T, WtI^} e A x for all Z, 1 < I < k Hence W® n \(j) T ) G Af n . 

• If S 7^ T, there exists an element i,- in T — S 1 (by choice of T). Ws\4>i ) £ A)- Hence 
H/| n |0 T ) Af n . In fact, Wf n |0 T ) is orthogonal to Af n . 

Hence, in the above linear combination (equation |l|), the only vector which has a 
nontrivial projection along Af n is W® n \4>T)- Hence, «t = leading to a contradiction. ■ 

Claim 2 {Wg n } Se rv \ Zie m a vector space of dimension at most Yl^Lo if)- 
Proof: By definition, for any set S, \S\ < n, 

W s = U t O s U t ^O s U t ^Os ■ ■ ■ UiOsUxOsUo 

where U , . . . , U t are unitary transformations (matrices) independent of the set stored. 
For any pair of indices 

(Ws)i,j = E {U t )i > kt-i{Os)kt-i > kt-i{Ut-l)kt-i,kt-2(Os)kt-2,kt-2 
k t -i,...,ko 

■ ■ ■ (Ui) kliko (Os)k ,k {Uo) ko j 

= E (^)^- 1 (-i) ws)] ^ i (^- 1 )^- 1 ^ 2 (-i) N " )1 ^ 2 

k t -i,—,k 

where, recalling the notation of Section |2|, x(S) is the string stored by the storage scheme 
for set S and Zj is either the single location in the string corresponding to index i or the 
empty set. 

Therefore, if we define T kt _ lt ... t k = Zfc t _i A l kt _ 2 A - • • A and [a^S 1 )^ to be the parity 
of the bits stored in x(S) at the locations of T, we have 

(Wskj = £ (-l) [x(S)lTfc --- fco (K)^- 1 (^-i)^ 1 A- 2 ---(^) fcl A(f/o) fc o, J 

fct_i,...,fco 

= e (-i) W5)]t E (^)^- 1 (^-i)^- 1 ^- 2 ---(^)^ 1 feo(^o) fco j 

^ T fet _ 1 ,..., fc0 =T 

Let us define for every set TC [s], \T\ < t, a matrix At as follows: 

(Ar)ij = E (Ut)i,kt-i(Ut-i)kt-i,k t -2 ■ ' ' (Ui)ki,k (Uo)ko,j 

k t _ 1 ,...,k 
T fc t _l,...,fc = T 

Then we have, 

W s = E (-1) [x ' (5)1t ^t (2) 
re(W) 
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Hence, 



\ 



(W s ) m 



(-lf^A Tl 



Tie 



( 



\ 



E (-i) [x{s)]T «A Tn 

J V r ™ e (<t> 



Ki<n 



J2 (-1)^tB 7 

Hit) 



where for f e (^) : 



E ^ 



a 7 



Tj A-AT„=T 

Tie(W),l<i<n 



Hence, we see that {Bf}j, / [»] \ span {W' i f n } S6 ('u\. So, {H / i f" 1 } 5s _ ( ' u \ lie in a vector 

\<nt) \<nj \<rLj 

space of dimension at most Y^=o (!)• ■ 
Now the theorem is an easy consequence of the above two claims. ■ 

Remark: Equation |2| in the proof of Claim above is similar to the statement of a lemma 
of Shi. 



Lemma 1 ( ||ShiOO| , Lemma 2.4] rephrased) Consider a quantum query algorithm with 
initial state vector |0) , with the black box unitary transformation representing a bit string 
x = Xi,...,x s . Let \(f>) be the state vector of the circuit after t queries to the black box. 
Then 



■ > T\ 



Tel 



where the <pT are independent of x. 



Shi proved his lemma using the observation by Beals et al. (see [|BBC + 98| , Lemma 4.1]) 
that the amplitudes of the basis states in the state vector \<f>) are multilinear polynomials 
of degree at most t in X\, . . . , x s . 

The space-time tradeoff equation for the exact quantum case holds for the one-sided 
error case too, as shown below. 

Theorem 2 The tradeoff result of Theorem [7] also holds for a quantum scheme where the 
query scheme may err with probability less than 1 on the positive instances (i.e. if an 
element is present it may be erroneously reported absent), but not on the negative instances 
(i.e. if an element is absent it has to be reported absent). 
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Proof: (Sketch) Essentially, the same proof of Theorem]]] goes through. Since the query 
scheme can make an error only if the element is present, we observe that the only vector 
in the linear combination (equation |l|) that has a non-zero projection on the space Af n , 



is the vector W^ n \ 



!>t). Hence «t = 0, and the operators {Ws} 



SCU,\S\<n 



continue to be 



linearly independent. Hence, the same tradeoff equation holds in this case too. ■ 
We now prove the lower bound on the space used by a two-sided e-error 1-probe quantum 
scheme. 

Theorem 3 Let n/m < e < 1/8. Suppose there is a scheme which stores subsets S of size 
at most n from a universe U of size m that answers membership queries, with two-sided 
error at most e, using one quantum probe. It must use space 



n 



n log(m/n) 
eV6 log(l/e) 



Proof: Since we are looking at a one probe quantum scheme, Ws = U\OsUq. We start by 
picking a family F of sets, F = {Si, . . . , Sk}, Si C U, \Si\ = n and \Si fl Sj\ < n/2 for all 
% ^ j. By picking the sets greedily ||EFF85| , |NW94j| , one obtains a family F with 



\F\ > 



Let t 



4 log \F\ 
nlog(l/(4e)) 



2 2 

Since, m/n > 1/e, 



m m—1 _ _ _ m-n/2+1 / m \ 
^ n n—1 n—n/2+1 ^ \~n ) 



n/2 



m W 2 
An) 



(3) 



41og|F| ^ 4nlog(m/(4n) ^ 



Hence, 



nlog(l/(4e)) ~ 2ralog(l/(4e)) 



41og|F| 41og|F| 61og|F| 

<t<— — 6 U 1 _ + i < 



nlog(l/(4e)) 



nlog(l/(4e)) 



nlog(l/(4e)) 



(4) 



Claim 3 {H^f^jseF fl^e linearly independent. 

Proof: Suppose there is a non-trivial linear combination 

£ a ff W| Bt = 



Fix a T G F. Let T = . . . , i n }. Define 

\<h) = (\<i>h) ® I0< 3 ) ® • • • ® 100)' 

Applying 0^ to the above linear combination, we get 
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a s{W s \K) ® • • • ® W s |0iJ)^ = 
Taking inner product of the above combination with the vector 

w® n *|<M = (WtI^) ® • • • ® Wrl^J)®* 

we get 



£ asC^IWjWrl^) • • • {K\Wlw T \M)* = (5) 



S€F 



For any ^ G S n T, K^lWjWrl^)! < L 



For any G T, WtI^-) = ^0 + ^1 where 1 > ||ui|| > yl — e and ||u || < w o G A 
and t>i G Ai. For any ij E T — S, Ws\4>i ) — u o + M i where 1 > ||w || > \/l ~~ e an d 
|| M i|| < an d «o € A) an d «i G A\. Hence 

K^lWlWrl^)] = \(u \v ) + ( Ul \ Vl )\ 

< \\uq\\ \\v \\ + ||wi|| \\vi\\ 

< 2^=6 



We now note that for every T G F, we have a linear combination as in equation [| above. 
We can write the linear combinations in the matrix form as aM = 0, where a = (as)s^F 
and M is a |F| x \F\ matrix whose rows and columns are indexed by members of F. For 
S T G F 

M(S,T) = ((<P l AW^ s W T \^ il )---^ i jWlW T \ ( p,jy 

where T = ...,i n }. The diagonal entries of M, M(T,T), are 1. The non-diagonal 
entries satisfy \M(S,T)\ < (£)(«-|Snr|)t < 6 nt/2_ 

Using the lower bound on t from we get 

\F\5 tn/2 = |F|(4e) in/4 < 1 

Hence (\F\ — l)(5) tn ^ 2 < 1. This implies that M is non-singular. [Suppose not. Let y be 
a vector such that My = 0. Let i be the location of the largest coordinate of y. We can 
assume without loss of generality that yi — 1. Now, the ith coordinate of the vector My 
is at least 1 — (\F\ — l)(5) tn ^ 2 > in absolute value, which is a contradiction.] So, as = 
for all S 1 G F. Hence {W® nt }seF are linearly independent. ■ 

Claim 4 {W^f n *} 5ei r ^ e ^ n vector space of dimension at most Yl^Lo ( s ) ■ 
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Proof: Similar to proof of Claim |2] in Theorem [I]. 
Using the two claims above, 



/ s\ fs + nt\ ( (s + nt)e x r 



< < 



\p\<y , , _ , 

V 7 / \ nt I \ nt 

j= o v/ v /v 

Using the upper bound on i from (||), we get 

' < F < ' 



4e 7 V n£ 




For values of e such that (l/4e) 1 / 6 > 2e, that is e < 4 1 (2e) 6 , using (||) and the lower 
bound on t from (|j), we get 

1 /6 

^ / 1 \ ' ^ 21og|F| ^ n log(m/4n) 



2e V4ey ~ e(4e) 1 /6 log(l/4e) ~ e(4e)V6 log(l/4e) 

= / w log(m/n) 
^eV6 1og(l/ e ) 

For 4 _1 (2e)~ 6 < e < 1/8, we recall the fact that Q (n log (m/n)) is always a lower bound 
(the information-theoretic lower bound) for the storage space. Thus, for these values of e 
too 

/ ?2log(m/n) 

Hence, the theorem is proved. ■ 
We now show how to extend the above argument for 2-sided e-error quantum schemes 
which make p probes. 

Theorem 4 For any p > 1 and n/m < e < 2~ 3p , suppose there is a scheme which stores 
subsets S of size at most n from a universe U of size m that answers membership queries, 
with two-sided error at most e, using p quantum probes. Define 5 = e 1//p . The scheme must 
use space 

n log(m/n) 



n 



<JV6 log(l/<5) 



Proof: (Sketch) The proof of this theorem is similar to the proof of Theorem |3|. Pick 
a family F of sets, F = {S\, . . . , Sk}, Si C U, \Si\ = n, \Si PI Sj\ < n/2 for all i ^ 

j, such that |F| > (m/4n) n / 2 . One can prove that {W^} S( _ p , t ■ 



41og|F| 



arc 



i nlog(l/(4e)) i 

linearly independent in exactly the same fashion as Claim |3] in Theorem |3| was proved. 
The difference is that {Wf nt } S&F lie in a vector space of dimension at most Y^=o {j} 
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instead of Y^j=o (j)- This statement can be proved just as Claim ^ in Theorem [1] was 
proved. Therefore, by a argument similar to that at the end of the proof of Theorem fj, we 
get a lower bound 

n log (m/n) 



n 



5V6 l g(l/(J) 



3.2 Classical schemes 

We now give the proof for the space-time tradeoff equation in the classical deterministic 
case. 

Theorem 5 Suppose there exists a classical deterministic scheme for storing subsets S 
of size at most n from a universe U of size m which uses s bits of storage and answers 
membership queries with t classical bit probes. Then, 

£(7)^(1) 

i=o v 7 i=0 v 7 

Proof: For 1 < i < m, let /j : {0, 1} S — > R denote the function for query i, which maps 
bit strings of length s to {0, 1} C R i.e. /, maps x G {0, 1} S to 1 iff the query scheme given 
query i and bit string x evaluates to 1. Consider a mapping $ : [ K ) — ► ({0, 1} S — > R) i.e. 
$ takes a subset of the universe of size at most n to a function from bit strings of length 
s to the reals. <3> is defined as follows 

$({}) = constant 1 function 
*(5) = /<i/fa ••■/<*, 5 = {ii > ...<*}, 



Claim 5 {$(S')} s , g ^ u \ are linearly independent over R. 
Proof: Suppose there exists a non-trivial linear combination 



Pick a set T of smallest cardinality such that ay 7^ 0. Let x(T) G {0, 1} S be the string 
stored by the storage scheme. Applying x(T) to the above linear combination, we get 

a S $(S)x(T) = 

HI) 

If S T, there exists an element i G U such that i E S — T. Then, fi(x(T)) = 0, and 
hence, $(S)(x(T)) =0. If S = T, then $(S)(a;(T)) = $(T)(x(T)) = 1. Hence, a T = 
which is a contradiction. Hence the claim is proved. ■ 
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Claim 6 {$(S , )} 5g / u \ lie in a vector space of dimension at most Y^=o Q ■ 



Proof: Since the query scheme is deterministic and makes at most t (classical) bit probes, 
given a query i, 1 < % < m, the function /j is modelled by a decision tree of depth at most 
t. Hence can be represented over R as a sum of products of at most t linear functions, 
where the linear functions are either y^ (representing the value stored at location j in the 
bit string) or 1 — yj (representing the negation of the value stored at location j). Note that 
for any y G {0, 1} S , at most one of these products evaluates to 1. Such a function can be 
represented as a multilinear polynomial in y±, y2, ■ ■ ■ , y s of degree at most t. A product of 
at most n such functions can be represented as a multilinear polynomial of degree at most 
nt. Hence, {$(S')} 5g ^ u ^ lie in the span of at most J2i=o (!) functions from {0, 1} S to R. 
From this, the claim follows. ■ 
From the above two claims, the theorem follows. ■ 
In fact, the tradeoff result can be extended to the one-sided error classical case too. 

Theorem 6 The tradeoff result of Theorem [| also holds for classical schemes where the 
query scheme may err with probability less than 1 on the positive instances (i.e. if an 
element is present it might report it to be absent), but not on the negative instances (i.e, if 
an element is absent it has to be reported as absent). In fact, the tradeoff result holds for 
nondeterministic query schemes too. 

Proof: (Sketch) A proof very similar to that of Theorem |5| goes through. We just 
observe that now the query scheme is a logical disjunction over a family of deterministic 
query schemes. If the query element is present in the set stored, there is a decision tree in 
this family that outputs 1. If the query element is not present in the set stored, then all 
the decision trees output 0. Let us denote by the family of decision trees corresponding 
to query element i, 1 < % < m. For any decision tree D in Fj, let gu : {0, 1} S — > {0, 1} be 
the function it evaluates. 

Let us now define fi = J^DeF 9 D - Then 



With this choice of fi, the rest of the proof is the same as in the deterministic case. ■ 
Now we give a simple proof of the lower bound for the space used by a classical ran- 
domised scheme which answers membership queries with two-sided error at most e and 
uses only one bit probe. 

Theorem 7 Let 1/18 > e > 1/m 1 / 3 and m 1 / 3 > 18n. Any classical scheme which stores 
subsets S of size at most n from a universe U of size m and answers membership queries, 
with two-sided error at most e, using one bit probe must use space 




( 



e 2/5 log(l/ e ) 



n logm 



) 
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Proof: Suppose there is a classical scheme which stores subsets of size n from a universe 
of size m using s bits of storage, and answers membership queries using one bit probe 



with two-sided error at most e. Define k = 



Therefore 



41og(27m) 
31og(l/4ee) 



Since m 1 / 3 > 1/e, t ^/TK > 4 

' ' 31og(l/4ee) — 



41og(27m) <k< 4 log(27m) < 5 log(27m) 
31og(l/4ee) ~ ~ 31og(l/4ee) ~ 31og(l/4ee) [ ' 

We repeat the query scheme k times and accept only if more than 3fc/4 trials accept. 
Then the probability of making an error on a positive instance (i.e. the query element is 
present in the set stored) is bounded by 

^Je fe/4 <(4e) fc / 4 e fc / 4 = (4ee) fc / 4 

The probability of making an error on a negative instance (i.e. the query element is not 
present in the set stored) is bounded by 



From lower bound on k from (|6|), we get 

Pr [Error on a positive instance] < (4ee) fc / 4 < 7^3173 < 3^ 
Pr [Error on a negative instance] < (4ee) 3fc//4 < 

Hence, the probability that a random sequence of coin tosses gives the wrong answer on 
some query q G U and a particular set S stored, is at most 

1 1 . . 1 

— x n H x [m — n) < - 

3n 27m v ; 2 

Call a sequence of coin tosses bad for a set S, if when S is stored, there is one query 
q G U for which the query scheme with these coin tosses gives the wrong answer. Thus, at 
most half of the coin toss sequences are bad for a fixed set S. By an averaging argument, 
there exists a sequence of coin tosses which is bad for at most half of the sets S G (Y) ■ By 
setting the coin tosses to that sequence, we now get a deterministic scheme which answers 
membership queries correctly for at least half the sets S G and uses k bit probes. 
From the proof of Theorem |5|, we have that 

nk / N / , x // 1 \ \ nk 



E 



< > < . < 



1 f m\ ^ ( s\ ^ fs + nk\ ^ ( e(s + nk) 

nk 



2\nJ ^— ' \ i I \ nk I \ nk 



1 /m\ n ^ f e(s + nk) 



2 V n J \ nk 
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Using the upper bound on k in (|6]) and the fact that m 1 / 3 > 18n, we get 




3fc/5 x 



2n/3 



, , < (27m) 2n/3 = (9m 2 ^) n < - (l8m 2 / 3 f <-(- 

-, \ 2nk/5 / i i \ \ nfc 

1 \ / e(s + nk) 



< , 

4ee / \ nk 




Arguing as in the last part of the proof of Theorem |3], and recalling that since m 1 / 3 > 18n, 
Q(n\ogm) is always a lower bound (the information-theoretic lower bound) for the storage 
space, we get 

nlogm 



e 2 / 5 log(l/e) 



We can extend the classical randomised two-sided error space lower bound above to the 
case of multiple bit probes. 

Theorem 8 Letp > 1, 18~ p > e > 1/m 1 / 3 andm 1 ^ > 18ra. Define 5 — e^ p . Any classical 
scheme which stores subsets S of size at most n from a universe U of size m and answers 
membership queries, with two-sided error at most e, using at most p bit probes must use 
space 

( nlogm \ 
U 2 / 5 log(l/5) J 

Proof: (Sketch) The proof of this theorem is similar to the proof of Theorem [7] above. 
We repeat the query scheme k 



times and accept only if more than 3k/ '4 



4 log(27m) 
31og(l/4ee) 

trials accept. We "derandomise" the new query scheme in a manner similar to what was 
done in the proof of Theorem [7|. We thus get a deterministic query scheme making kp bit 
probes and answering membership queries correctly for at least half the sets S G ( ) . The 
rest of the proof now follows in the same fashion as the proof of Theorem [7[ ■ 



4 Conclusion and open problems 

In this paper, we introduce the quantum bit probe model and study the complexity of the 
static membership problem in this model. We study the problem in the exact and bounded 
(one-sided and two-sided) error versions of the model and give lower bounds which almost 
match the corresponding classical upper bounds. We also give stronger /simplified proofs 
of lower bounds for the problem in the classical setting. 



17 



The paper of Buhrman et al. [[BMRVOOfl also considers classical schemes for the static 
membership problem where the error is bounded and restricted only to negative instances 
(i.e. when the query element is not a member of the stored set). For such schemes, they 
give almost matching upper and lower bounds. But for negative one-sided error quantum 
schemes, we can only prove similar lower bounds as for two-sided error quantum schemes. 
Also, we do not know if there are negative one-sided error quantum schemes better than 
the classical ones in |[BMRV0qi . Thus there is a gap between the upper and lower bounds 



here, and resolving it is an open problem. 

The classical bit probe model has been used in the past to study other static data struc 
ture problems like, for example, perfect hashing and element containment (see e.g. [PagOl 



Mil93|| ). The complexity of these problems in the quantum bit probe model is an important 



open problem. 
Acknowledgements 

We thank the anonymous referees for their comments, which have helped us to improve the 
presentation of the paper and fix some bugs in the parameters appearing in some theorems. 
We also thank them for pointing out the similarities between Claim |2| in Theorem [I], and 
the observations of Beals et al. [|BBC + 98|j and Shi [|ShiOO . 



References 



[AmbOO] A. Ambainis. Quantum lower bounds by quantum arguments. In Proceedings 
of the 32nd Annual ACM Symposium on Theory of Computing, pages 636-643, 
2000. Also quant-ph/0002066. 

[AST + 98] A. Ambainis, L. J. Schulman, A. Ta-Shma, U. Vazirani, and A. Wigderson. The 
quantum communication complexity of sampling. In Proceedings of the 39th 
Annual IEEE Symposium on Foundations of Computer Science, pages 342-351, 
1998. 

[BBBV97] C. Bennett, E. Bernstein, G. Brassard, and U. Vazirani. Strengths and weak- 
nesses of quantum computation. SIAM Journal of Computing, 26(3):1510-1523, 
1997. Also quant-ph/9701001. 

[BBC + 98] R. Beals, H. Buhrman, R. Cleve, M. Mosca, and R. de Wolf. Quantum lower 
bounds by polynomials. In Proceedings of the 39th Annual IEEE Symposium on 
Foundations of Computer Science, pages 352-361, 1998. Full version to appear 
in the Journal of the ACM. Also quant-ph/9802049. 

[BdWOl] H. Buhrman and R. de Wolf. Communication complexity lower bounds by 
polynomials. In Proceedings of the 16th Annual Conference on Computational 
Complexity, pages 120-130, 2001. Also cs.CC/9910010. 



18 



[BF92] L. Babai and P. Frankl. Linear Algebra Methods in Combinatorics (with applica- 
tions to Geometry and Computer Science). Preliminary Version 2, Department 
of Computer Science, The University of Chicago, September 1992. 

[BMRVOO] H. Buhrman, P. B. Miltersen, J. Radhakrishnan, and S. Venkatesh. Are bitvec- 
tors optimal? In Proceedings of the 32nd Annual ACM Symposium on Theory 
of Computing, pages 449-458, 2000. 

[DR82] A. Dyachkov and V. Rykov. Bounds on the length of disjunctive codes. Prob- 
lemy Peredachi Informatsii, 18(3):7— 13, 1982. (In Russian). 

[EFF85] P. Erdos, P. Frankl, and Z. Fiiredi. Families of finite sets in which no set is 
covered by the union of r others. Israel Journal of Mathematics, 51:79-89, 
1985. 

[FKS84] M. Fredman, J. Komlos, and E. Szemeredi. Storing a sparse table with 0(1) 
worst case access time. Journal of the Association for Computing Machinery, 
31(3):538-544, 1984. 

[Gro96] L. Grover. A fast quantum mechanical algorithm for database search. In 
Proceedings of the 28th Annual ACM Symposium on Theory of Computing, 
pages 212-219, 1996. Also quant-ph/9605043. 

[Mil93] P. B. Miltersen. The bitprobe complexity measure revisited. In Proceedings 
of the 10th Annual Symposium on Theoretical Aspects of Computer Science, 
pages 662-671, 1993. 

[MP69] M. Minsky and S. Papert. Perceptrons. MIT Press, Cambridge, Mass., USA, 
1969. 

[NW94] N. Nisan and A. Wigderson. Hardness vs randomness. Journal of Computer 
and System Sciences, 49:149-167, 1994. 

[PagOl] R. Pagh. On the cell probe complexity of membership and perfect hashing. 

In Proceedings of the 33rd Annual ACM Symposium on Theory of Computing, 
pages 425-432, 2001. 

[ShiOO] Y. Shi. Lower bounds of quantum black-box complexity and degree of approx- 
imating polynomials by influence of boolean variables. Information Processing 
Letters, 75 (1-2): 79-83, 2000. Also quant-ph/9904107. 

[Yao81] A. C-C. Yao. Should tables be sorted? Journal of the Association for Com- 
puting Machinery, 28(3):615-628, 1981. 



19 



